Nonparametric estimation of long-tailed density functions and its application to the analysis of World Wide Web traffic

نویسندگان

  • Natalia M. Markovich
  • Udo R. Krieger
چکیده

The study of WWW-traffic measurements has shown that different traffic characteristics can be modeled by long-tail distributed random variables (r.v.s). In this paper we discuss the nonparametric estimation of the probability density function of long-tailed distributions. Two nonparametric estimates, a Parzen–Rosenblatt kernel estimate and a histogram with variable bin width called polygram, are considered. The consistency of these estimates for heavy-tailed densities is discussed. To provide the consistency of the estimates in the metric space L1, the transformation of the initial r.v. to a new r.v. distributed on the interval [0, 1] is proposed. Then the proposed estimates are applied to analyze real data of WWW-sessions. The latter are characterized by the sizes of the responses and inter-response intervals as well as the sizes and durations of sub-sessions. By these means the effectiveness of the nonparametric procedures in comparison to parametric models of the WWW-traffic characteristics is demonstrated. © 2000 Published by Elsevier Science B.V.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical Topology Using the Nonparametric Density Estimation and Bootstrap Algorithm

This paper presents approximate confidence intervals for each function of parameters in a Banach space based on a bootstrap algorithm. We apply kernel density approach to estimate the persistence landscape. In addition, we evaluate the quality distribution function estimator of random variables using integrated mean square error (IMSE). The results of simulation studies show a significant impro...

متن کامل

Representing a method to identify and contrast with the fraud which is created by robots for developing websites’ traffic ranking

With the expansion of the Internet and the Web, communication and information gathering between individual has distracted from its traditional form and into web sites. The World Wide Web also offers a great opportunity for businesses to improve their relationship with the client and expand their marketplace in online world. Businesses use a criterion called traffic ranking to determine their si...

متن کامل

Moment Inequalities for Supremum of Empirical Processes of‎ ‎U-Statistic Structure and Application to Density Estimation

We derive moment inequalities for the supremum of empirical processes of U-Statistic structure and give application to kernel type density  estimation ‎and estimation of the distribution function for functions of observations.  

متن کامل

depth-based nonparametric multivariate analysis and its application in review of new treatment methodology on osteoarthrotic

In this article, first, we introduce depth function as a function for center-outward ranking. Then we present and use half space or Tukey depth function as one of the most popular depth functions. In the following, multivariate nonparametric tests for location and scale difference between two population are expressed by ranking and statistics based on depth versus depth plot. Finally, accord...

متن کامل

Spectral Estimation of Stationary Time Series: Recent Developments

Spectral analysis considers the problem of determining (the art of recovering) the spectral content (i.e., the distribution of power over frequency) of a stationary time series from a finite set of measurements, by means of either nonparametric or parametric techniques. This paper introduces the spectral analysis problem, motivates the definition of power spectral density functions, and reviews...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Perform. Eval.

دوره 42  شماره 

صفحات  -

تاریخ انتشار 2000